Reinforcement learning

Results: 1147



#Item
871Dynamic programming / Stochastic control / Markov models / Markov decision process / Operations research / Reinforcement learning / Automated planning and scheduling / Travelling salesman problem / Stochastic / Statistics / Markov processes / Probability and statistics

Nearly Deterministic Abstractions of Markov Decision Processes Terran Lane and Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory 200 Technology Square Cambridge, MA 02139 {terran,lpk}@ai.mit.edu

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:51
872Applied mathematics / Mathematics / Reinforcement learning / Global optimization / Simulation / Algorithm / Dynamic programming / Genetic algorithm / Mathematical optimization / Operations research / Science

Red Cloud with MATLAB case study CAC we enable your success

Add to Reading List

Source URL: www.cac.cornell.edu

Language: English - Date: 2012-04-23 12:29:10
873Dynamic programming / Markov processes / Stochastic control / Network theory / Markov decision process / Reinforcement learning / Symbol / Algorithm / Shortest path problem / Statistics / Mathematics / Applied mathematics

Hierarchical Solution of Large Markov Decision Processes Jennifer Barry and Leslie Pack Kaelbling and Tom´as Lozano-P´erez MIT Computer Science and Artificial Intelligence Laboratory Cambridge, MA 02139, USA {jbarry,lp

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2012-06-11 20:17:02
874Markov processes / Stochastic control / Robot control / Reinforcement learning / Q-learning / Markov decision process / Kalman filter / Multi-armed bandit / Machine learning / Statistics / Markov models / Dynamic programming

All learning is local: Multi-agent learning in global reward games Yu-Han Chang MIT CSAIL Cambridge, MA 02139

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:52
875Mathematics / Nash equilibrium / Strategy / Solution concept / Minimax / Best response / Matching pennies / Q-learning / Reinforcement learning / Game theory / Problem solving / Decision theory

Playing is believing: The role of beliefs in multi-agent learning Yu-Han Chang Artificial Intelligence Laboratory Massachusetts Institute of Technology

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:51
876Cognitive science / Behaviorism / Educational psychology / Distance education / Reinforcement / E-learning / Practice / Motivation / Educational software / Education / Behavior / Learning

Microsoft Word - TeachingSevereLearningDeficiencies.doc

Add to Reading List

Source URL: www.dttrainer.com

Language: English - Date: 2011-05-20 13:49:14
877Computing / Markov processes / Markov models / Equations / Mathematical optimization / Markov decision process / Reinforcement learning / Automated planning and scheduling / Bellman equation / Statistics / Dynamic programming / Control theory

Toward Hierachical Decomposition for Planning in Uncertain Environments Terran Lane and Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory Cambridge, MA, 02139 USA terran,lpk @ai.mit.edu

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:51
878Stochastic control / Reinforcement learning / Markov decision process / Statistics / Dynamic programming / Markov processes

Efficient Distributed Reinforcement Learning Through Agreement Paulina Varshavskaya, Leslie Pack Kaelbling and Daniela Rus Abstract Distributed robotic systems can benefit from automatic controller design and online adap

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2008-09-29 22:10:07
879E-learning / Reinforcement / Peer-to-peer / Behavior / Learning / Social information processing / Education / Behaviorism / Distance education

Microsoft Word - Evidence Sheet_2014-2015_SH.doc

Add to Reading List

Source URL: tennessee.gov

Language: English - Date: 2014-07-30 15:25:08
880Game theory / Cybernetics / Machine learning / Search algorithms / Learning / Reinforcement learning / Markov decision process / Multi-armed bandit / Algorithm / Statistics / Mathematics / Applied mathematics

Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2005-11-02 21:38:45
UPDATE